AD-  A243  840 


NFS  OR-91-031 


Monterey,  California 


DTIC 

electe 

DEC  3 1 1991 

D 


Approved  for  public  release;  distribution  is  unlimited. 


Prepared  for: 

Naval  Postgraduate  School 
Monterey,  CA 


^oooo 


Reproduced  From 
Best  Available  Copy 


NAVAL  POSTGRADUATE  SCHOOL, 
MONTEREY,  CALIFORNIA 


Rear  Admiral  R.  W.  V/csi,  Jr.  Harrison  Shull 

Superintendent  Provost 


This  report  was  prepared  in  conjunction  with  research  funded  under  the  Naval 
Postgraduate  School  Reseaxch  Council  Research  Program. 

This  report  was  prepared  by: 


_ 

^LAN  R.  WASHBURN 
Professor  of  Operations  Research 

Reviewed  by: 


Released  by: 


Departnient  of  Operations  Research 


UNCLASSIFIED _ 

SECURITY  CU^SSIFICATION  OF  THIS  PAGE 


1«.  REPORT  SECURITY  CLASSIFICATION 

UNCLASSIFIED 


2a.  SECURITY  CLASSIFICATION  AUTHORITY 


2b.  OECOSSIFICATION/DOWNGRAOING  SCHEDULE 


4.  PERFORMING  ORGANIZATION  REPORT  NUMBEP.(S, 

NFS  OR-91-031 


REPORT  DOC- IMENTATION  PAGE 


lb.  RESTRICTIVE  MARKINGS 


Fom/  iprovod 
OMBNo.  0704.^188 


3.  DISTRIBUTION  /AVAILABILITY  OF  REPORT 

Approved  for  public  release;  distribution  is 
unlimited. 


5.  MONITORING  ORGANIZA.TION  REPORT  NUMBER(S) 


8b.  OFFICE  SYMBOL 
(Ifapplicam 


6c.  ADDRESS  {Oly,  State,  and  ZIP  Code/) 


Monterey,  CA  93943 


8a.  NAME  OF  FUNOIMaSPONSORING 
ORGANIZATION 

Naval  Postgraduate  School 


8c.  ADDRESS  {Gty,  Slate,  and  ZIP  Codtf, 

Monterey,  CA  93943 


1 1.  TITLE  {Include  Se^nty  Clesstncalion) 

More  on  Cumulative  Search  Evasion  Carnes 


12.  PERSONAL  AUTHOR(S) 

Alan  R.  Washburn 


1 3a.  TYPE  OF  REPORT  1 3b.  TIME  COVERED 

Technical  Report  prom _ .to _ 


16.  SUPPLEMENTARY  NOTATION 


6b.  OFFICE  SYMBOL  7a.  NAME  OF  MONITORING  ORGANIZATION 
(If  appUcabkl) 

OR  O&MN,  Direct  Fundin 


7b.  ADDRESS  (Gty.  State,  and  ZIP  Good) 


Monterey,  CA  93943-5006 


9.  PROCUREMENT  INSTRUMENT  IDENTIFICATION  NUMBER 


0.  SOURCE  OF  FUNDING  NUMBERS 


PROGRAM 

PROJECT 

TASK 

ELEMENT  NO. 

NO. 

NO. 

WORK  UNIT 
ACCESSION  NO. 


14.  DATE  OF  REPORT .( Year,  month  day)  15.  PAGE  COUNT 

October,  1991  25 


17.  ,  COSATI  CODES  |  13.  SUBJECT  TERMS  {Continue  on  reverse  it  necessary  arrd  identify  by  block  rwmber} 

FIELD  1  GROUP  1  SUB-GROUP  j  Search;  evasion;  game 


19.  ABSTRACT  (Continue  on  reverse  if  necessary  and  identify  by  block  niimber) 


This  report  generalizes  the  form  of  the  payoff  function  so  that  all  track  crossings  must  involye 
contact.  A  more  computationally  efficient  form  of  the  one-dimensional  game  is  also  giyen. 


. 


20.  DISTRIBUTION/AVAILABILITY  jPaBSTRaCT  2 1 .  ABSTRACT  SECURITY  OASSICIATION 

@  UNCLASSIFIEDrtJNLIMITEO  )  J  CAME  AS  RPT.  Q  OTIC  USERS  UNCLASSIFIED _ _ _ 


22a.  Name  OF  RESPONSIBLE  INDIVIDUAL  T 22b.  TELEPHONE  ( Indude  AfM  Code)  2c.  OFFICE  SYMBOL 

A.  Washburn  II  (406)  646-2654  I  OR/Er 

00  Form  1473,  JUN  86  Previous  edibifns  areobnolete.  .  SECURITY  CLASSIFICATION  OF  THIS  PAGI 

S/N  0l02*iF-'014-6$03  UNCLASSIFIED 


MORE  ON  CUMULATIVE  SEARCH  EVASION  GAMES 


1.  INTRODUCTION 


Eagle  and  Washburn  (1990)  introduced  Cumulative  Search  Evasion 
Games  (CSEGs)  as  two-person  ?ero  sum  games  where  the  cumulative  payoff 
over  T  time  periods  is  Xt  and  yt  being  the  locations  of  searcher 

and  evader,  respectively,  at  time  t.  A  path  for  the  searcher  is  a  sequence 
xi, ...,  x^  where  xi  £  So  and  Xf+i  €  S(xt,f)  for  t  >  1,  the  sets  So  and  S(*,*)  being 
given,  and  similarly  for  the  evader  except  yi  €  Eq  ana  yt  €  E(yt ,  0-  All  of  these 
sets  are  nonempty  subsets  of  Cs  a  given  finite  set  of  "cells."  A  mixed  strategy 
for  the  searcher  is  a  probability  distribution  over  paths.  Letp(x,f)  be  the 
corresponding  marginal  distribution,  the  probability  that  the  searcher 
occupies  cell  x  at  time  t,  and  let  i}(y,0  be  defined  similarly  for  the  evader. 
Then  the  expected  payoff  is  J^yA(x,y,f)p(x,f)(j(y,f).  This  observation, 

together  with  the  observation  that  the  optimization  problem  for  one  player 
when  the  marginal  distribution  of  the  other  is  given  is  a  shortest  or  longest- 
path  problem,  formed  the  basis  of  tv'O  solution  methods  for  solving  CSEGs: 
Fictitious  Play  and  Linear  Programming  (LP).  Only  the  LP  method  will  be 
discussed  here. 

One  might  hope  to  formulate  an  LP  for  the  searcher  in  which  the  only 
variables  needed  to  describe  the  searcher's  mixed  strategy  are  p<x,t),  since 
those  suffice  to  express  the  expected  payoff.  However,  Eagle  and  Washburn 
found  it  necessary  to  introduce  the  joint  probabilities 
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u{i,j,  t)  =  probability  that  the  searcher  occupies  cell  i  at  time 
t-1,  and  cell  /  at  time  t, 

together  with  network  constraints  to  the  effect  that  probabilities  "flowing" 
into  and  out  of  a  ceil  must  balance.  The  necessity  to  include  these  joint 
probabilities  is  disappointing,  since  ’n  large  problems  there  are  many  more 
u-variables  than  p -variables.  One  of  the  goals  of  this  paper  is  to  show  that  the 
u-variables  can  be  avoided  in  certain  one-dimensional  CSEGs.  This  is  the 
subject  of  the  next  section.  Using  only  the  p-variables  makes  it  possible  to 
solve  larger  CSEGs  than  would  otherwise  be  possible. 

The  other  goal  of  this  paper  is  to  show  that  the  payoff  at  time  t  in  a  CSEG, 
can  be  generalized  to  A(Xt.i,  Xt,yt-uyt>  0  If  u-variables  are  retained.  The 
required  theorems  and  LP  formulation,  together  with  an  example  illustrating 
the  value  of  the  generalization,  is  the  subject  of  Section  3. 

2.  THE  ONE-DIMENSIONAL  CSEG 

In  thi^  section  the  positions  of  both  parties  must  at  all  times  be  in  the  set  of 
cells  C  =  (1, ...,  N},N  ^1,  with  transitions  from  i  to  /  at  f  being  permissible  if  i  € 
C,  ;■  G  C,  and  |  i- ;|  S  1.  These  rules  define  E(*,*)  and  S(«,*).  The  payoff 
function  A (i,/,  t)  is  unrestricted. 

Suppose  for  the  moment  that  the  searcher's  marginal  probabilities  p{i,  t) 
were  known  to  the  evader,  in  which  case  any  evader  path  that  visits  cell  /  at 
time  t  must  pay  a  penalty  ("penalty"  because  the  evader  is  the  minimizer)  of 

p{i,l)A{i,j,t).  Let  g(/,  t)  be  the  minimum  possible  cumulative  payoff  from 
j€C 

time  t  onwards,  given  that  the  evader  occupies  cell  /  at  time  f.  Then,  taking 
g(*,  T  + 1)  ■  0  for  convenience,  g(«,  •)  must  satisfy  the  recursion 
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(1) 


*  X  pO'-OA(i,;,r)  +  min  gik,t+l);j  £C,l^t^T 

itc 


Since  the  evader  must  be  in  Eo  at  time  1,  the  minimum  possible  payoff  is 

min^(;,l),  which  the  pursuer  wants  tc  maximize.  This  leads  to  the  following 
;€£o 

Linear  Projgram: 

maximize  gQ 

subject  to 

*  £p(Lf)^(L/,f)  -  g(*:,f  +  l)^0;j  eC,l^f  ^T,k  eE{j,t), 

ieC 


and  some  feasibility  constraints  on  p(*,  •). 

Eagle  and  Washburn  employed  the  u-  variables  in  expressing  the 
feasibility  constraints  on  p(«,  •  ).  The  object  here  is  to  find  a  way  of 
expressing  those  constraints  without  defining  any  new  variables.  First  we 
prove 

Theorem  1.  In  the  one-dimensional  CSEG,  p(*,  •)  is  feasible  if  the  following 
feasibility  constraints  hold; 

ieSo 

k  k*l 

(left)  J  p{i,t  ♦  1)  *  £  i  0  ;1  ^  jfc  <  ^  /  <  T 

1-1  i-r 

N  N. 

(right)  j;p(i,f  +  l)  -  J^p(i,f)  SO  ;lS]t  <.W;lSf  <  T 

i^k 

N 

Jp(i,f)-1  ;l<fST 

p(i,/)^0  ;lSiSN;lSfST 
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Proof:  Assume  that  the  feasibility  constraints  hold,  and  consider  the 
proposition  Pt  that  there  exists  a  feasible  stochastic  searcher  motion  process 
for  which  the  marginal  distributions  are  p(*,t);  1  S  f  S  T.  Pi  is  clearly  true, 
since  the  feasibility  constraints  in  that  case  require  only  that  the  searcher  begin 
in  So-  If  it  can  be  shown  that  Pj  implies  Pt+v  the  theorem  will  be  established 
by  induction.  Toward  this  end,  let  cells  1, ...,  N  at  time  T  be  "sources"  with 
probability  p,  ■  p{i,  T)  each,  and  let  the  same  ceils  at  time  T  +  1  be  "siidcs"  with 
probability  qi  ■  p{i;  T  +  1)  each.  To  establish  Pt+i,  it  is  sufficient  to  show  that 
there  exist  N  ^  joint  occupancy  probabilities  Uij  such  that 
Z;  “'7  *  “'7 “  ®  unless  j  €  E(/,  t),  the  latter  constraint 

reflecting  the  requirement  that  transitions  beyond  neighboring  cells  are  not 
allowed.  In  other  words,  it  must  be  possible  to  "ship"  a  unit  of  probability 
from  sources  to  sinks,  with  uq  being  the  amount  shipped  from  source  i  to  sink 
j.  The  "left  biased"  method  (LB)  below  is  one  constructive  method  for 
accomplishing  this.  LB  proceeds  through  the  sources  in  increasing  order, 
shipping  probability  to  the  lowest  numbered  sink  that  is  not  yet  satisfied  until 
the  source  being  considered  is  exhausted,  then  proceeding  to  the  next  source 
until  allN  sources  have  been  considered.  If  LB  makes  u,y  >  0  for  some  i  and 
some  /  <  i -  1  (alternatively  ;>  i  +  1),  we  say  that  a  left  (alternatively  right) 
difficulty  occurs  at  node  i.  To  complete  the  proof  it  is  required  to  show  that 
no  difficulties  of  either  type  can  occur  as  long  as  the  feasibility  constraints 
hold. 

Suppose  that  no  difficulties  occur  in  cells  1,  ...,  k  •  1,  but  that  a  left 
difficulty  occurs  in  cell  k  (necessarily  ki3,  since  left  difficulties  are  not 
possible  in  cells  1  and  2).  Since  all  of  the  probability  in  sources  1, ...,  ifc-I  can 
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be  shipped  to  sinks  1, k-2  without  satisfying  one  of  those  sinks  (otherwise 
the  left  difficulty  could  not  occur  in  cell  jt),  necessarily 

k-2  lt-1 

I -  I  Pi  >  0. 

i-1  i-1 

But  this  inequality  is  in  the  opposite  seiise  of  one  of  the  left  constraints,  so  a 
left  difficulty  cannot  occur  in  cell  k.  Suppose  instead  that  there  is  a  fight 
difficulty.  A  right  difficulty  occurs  for  the  first  time  in  cell  k  only  if  there  is 
more  probability  in  sources  1,  k  than  is  required  to  satisfy  sinks  1, k  +  !> 
so 

it+i  ifc 

1-1  /-I 

Since  (p,)  and  {qi)  are  both  constrained  to  be  probability  distributions,  it 
follows  that 

N  N 

I  <?i  •  Ip/>o- 

{•k*2  i"k*l 

But  this  contradicts  one  of  the  right  constraints,  so  right  difficulties  cannot 
occur  either. 

Since  neither  right  nor  left  difficulties  can  occur,  LB  will  discover  a 
feasible  set  of  joint  probabilities  uq.  This  completes  the  proof.  □ 

Obviously  there  is  a  symmetrically  defined  "right-biased  method"  that 
will  discover  a  possibly  different  Set  of  feasible  joint  probabilities.  In  fact 
there  are  many  such  methods  and  many  feasible  sets  of  joint  probabilities. 
Formulating  the  searcher's  linear  program  without  reference  to  these  joint 
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probabilities  has  the  advantage  of  eliminating  many  alternate  optima,  in 
addition  to  the  computational  savings  achieved  by  eliminating  variables.  The 
revised  formulation,  with  dual  variables  shown  in  braces,  is  program  LP: 

maximizego 

subject  to  {9(/'l)} 


N 

-  g(A,t  + 1)  ^  0 

j-l 

N 

j-l 


;/  eC,l  ^t<T,k  cE(;,f)  +  1)} 


ieSo 


'  k  k*\ 

£p(i,t  +  1)  -  £p(i,t)  ^0 


J-l  J-l 


N  N 

£  p(/,t  +  i)  •  £p(i,f)  io 
j  -fc  +1  j'-k 

N 

lp(i*0  *1 

J-l 


;{*i} 

;l^k  <N,l^t<T  {/(it,/)} 

;l^k<N,l^t<T  {T{k,t) 
;l<t^T  {hf} 


It  has  been  established  so  far  that  the  value,  v,  of  the  CSEG  is  at  least  go- 
The  possibility  still  remains  that  v  >  go-  To  establish  v  ---  go,  the  dual  of  LP  will 
be  shown  to  be  a  Linear  Program  whose  objective  function  is  an  upper  bound 
on  the  game  value.  Consideration  of  the  dual  will  also  provide 
interpretations  of  the  dual  variables  in  LP;  the  notation  used  above  anticipates 
that  ijij,!)  can  be  interpreted  as  ':he  probability  described  earlier,  for  example, 
biit  that  fact  has  yet  to  be  established  formally. 

The  dual  of  LP  involves  the  sums  J{Jt,/)  ■  L(/,/)  and 
2*jt,ir(k,t)  *  R(i,t).  For  compactness  we  will  write  L(*,*)  and  R(*,*)  below, 
even  though  ‘he  sums  are  actually  meant,  and  we  vill  also  use  the  convention 
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that  L(0,t)  *  and  R{N+l,t)  ■  R{N ,1)-  Note  that,  since  K*,*)  and  r(«,*)  are 
nonnegative,  L(*,f)  and  are  nonii  :re?sing  and  nondecreasing  cell 

functions,  respectively,  for  1  ^  f  <  T.  Finally,  the  set  E‘(i,f)  consists  of  those 
cells  from  which  the  evader  at  time  t-1  can  transition  to  cell  i  at  time  t.  The 
dual  of  LP  is  DLP: 

T 

minimize  hf 
'  N 

subject  tc  hi  -  £  A(i,;,l)  V  u(;,A:,2)  -  L(i  •  1,1)  -  R(i  + 1,1)  i  0 
ri 

;J€So  {P(M)} 

N 

fh  *  X  X  v{j,k,i  + 1)  +  L{i,t  - 1)  +  R(/,f  - 1)  -  L(i  ^  1,0  .  R{2  +  1-,/)  ^  0 


;i€C,l  <f  <r 

{p(»d)} 

N  '  ' 

j;  A{i,jJ)q{jJ)  +  L(/,T  - 1)  +  R(i,T.  1)  2  0 

;»ec 

{r(/,T)} 

£u(i,/,t+T)-  J)^fc,i,0-0 

;€£(j,2)  <r€£*(/,2) 

;j  €C,1  <t  <T 

(jik,T)  ’  J^v{j,k,7) -0 

/€£•(*  T) 

;)t  eC 

{^(fc,T)} 

fc€^/,l). 

;/€C 

j;<?(/,i)-i 

■  / 

{^0} 

,i€£o 


/(i,0  2  0;  r(/,0  2  0;  2  0 
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The  last  four  sets  of  constrainLs  in  DtP  have  the  effect  of  requiring  that 
be  a  feasible  marginal  distribution  for  the  evader,  with  being 

the  joint  occupancy  probabilities.  The  first  three  sets  of  constraints  can  be 
simplified  somewhat  by  defining  y(i,  t)  ■ 

that  y(j,t)  is  the  average  payoff  to  the  searcher  at  time  t  if  he  occupies  cell  i  at 
that  time,  and  also  L(»,T)  =  R(»,T)  =  0.  In  that  case  the  first  three  sets  of 
constraints  can  be  summarized  as 

hi  -  y(i,l)  -  L(i  - 1. 1)  -  R(i  + 1,1)  ^0  ;  »  cSq  (2) 

ht  +  +  Rii,t  -1)  -  L(f  -  l,t)  -  R(i +  l,t)>0  ;  i  eC,l<  T  (3) 

The  question  now  is.  "Do  (2)  and  (3)  guarantee  that  the  accumulated  payoff  is 
T 

at  most  for  aby  feasible  searcher  path?"  Theorem  2  answers  this 

question  in  the  affirmative. 

Theorem  2:  Suppose  that  (2)  and  (3)  hold,  with  L(*,f)  and  R(*,f)  being 
nonincreasing  and  nondecreasing  functions,  respectively,  on  {1, ...,  N),  and 
L(*,T)  =  R(*,T)  =  0.  Let  ii,  be  any  sequence  of  integers  such  that  xj  €So, 
l^Xj^N  for  1  S  tsT,  and  |x/-X|.i|  :S  1  for  f  >  1.  Then  ^ 

Proof:  Substitute  Xj  for  i  in  the  t**'  inequality  of  (2)-{3),  and  sum  all  T 
inequalities.  The  result  is 

T.  T  ,T  _  . 

I •  Z  *  Z  W'o'  • « •  tfe-i  •  1.1  •  1)1  ‘[•■'(jti.i  - 1)  •  R(i(-i  ♦  1.1  •  i)|  i  0 

t-1  f-1  f-2 

(4) 
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Since  L(*,f-1)  is  nonincreasing  and  since  Xt  S  Xt-l,  L{xtrt~l)  -  ^  0  for 

t  =  2, T.  Similarly  R(xi,  f-1)  -  <  0.  Therefore  the  third  sum  in 

(4)  is  nonpositive,  and  the  theorem  follows  directly.  □ 

Theorem  2  implies  that  the  optimized  go  from  LP  is  the  value  of  the 
CSEG,  as  well  as  providing  probabilistic  interpretations  for  the  dual  variables 
^(f/ 1)/  v(j,k,t+l),  and  qii,  T).  Thus  the  value  of  the  game  and  both  optimal 
strategies  can  be  obtained  from  LP. 

Bothwell  (1990)  reports  on  some  experiments  in  using  LP  as  above  (as  well 
as  other  methods)  to  solve  a  one-dimensional  CSEG  wnere  /.(;>;,  f)  indicates 
whether  i  =  j,  so  that  the  payoff  is  "total  number  of  coincidences,"  with  So  =  {1} 
and  Eo  =  {N}.  He  discovered  that  the  new  formulation  permitted  solutions  in 
about  one  fourth  of  the  time  of  the  Eagle-Washburn  method,  and  was  thus 
able  to  solve  games  up  to  N  =  30.  His  Figures  1-6  describe  the  solution  for 
N  =20  and  T  =  31.  The  searcher's  strategy  pi*,*)  is  shown  digitally  in  Figure  1 
and  graphically  in  Figure  2.  Figure  3  is  a  blowup  for  t  >  21,  showing  that 
p(*,31)  is  finally  uniform,  that  p{l,t)  goes  through  a  maximum,  and  that  p{20,t) 
goes  through  a  minimum.  The  latter  two  features  were  unanticipated,  but 
seem  to  be  regular  features  of  the  solution  for  large  N.  Basically  the  searcher 
"rushes"  from  cell  1  to  cell  20,  except  that  he  has  a  small  probability  of 
reversing  his  direction  after  time  10.  The  cumulative  effect  t>f  all  the  small 
probabilities  is  to  make  p(*,f)  u.  form  fort  =  31. 

Figures  4-6  show  the  evader's  marginal  probabilities  qi*,*).  Basically  the 
evader  stays  in  cell  20,  except  that  there  is  at  all  times  (even  f  =  1)  a  small 
probability  of  making  a  break  for  the  other  side;  one  is  reminded  of  Auger's 
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Figure  1.  Searcher  Marginal  Probabilities  (xlOOO)  for  20-Cell  CSEG 


Figure  2.  Searcher  Strategy 


Figure  3.  Searcher  Strategy>~»Finat  Time  Periods 
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Figure  4.  Evader  Marginal  Probabilities  (xlOOO)  for  20*Cen  CSEC 
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Figure  5.  Evader  Strategy 


Figure  6.  Evader  Strategy<»Finat  Time  Periods 
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(1991)  "Wait-and-run"  strategies.  By  time  31  the  evader's  position,  like  the 
searcher's,  is  uniform  over  all  20  cells.  It  follows  (see  Eagle  and  Washburn) 
that  the  game  where  T  >  31  starts  the  same  way  as  when  T  =  31,  but  that  it  is 
optimal  for  each  player  to  remain  stationary  for  31  ^  ^  T. 

3.  GENERALIZED  PAYOFF 

In  this  section  it  will  be  shown  that  the  payoff  in  a  CSEG  can  be 
generalized  to  with xo  andyo  specified.  Solution  of 

such  games  will  require  retention  of  the  joint  occupancy  probabilities,  so  the 
contribution  of  this  section  is  toward  modeling  flexibility,  rather  than 
computational  efficiency. 

Let  So  =  {xq},  Eo  =  {yo}»  and  let  S(*,*)  and  £(•,*)  be  as  defined  in  Section  1 
except  that  S(xo,0)  and  £(yo/0)  are  now  (rather  than  So  and  £o)  the  sets  of  cells 
feasible  for  searcher  and  evader  at  Time  1.  So  and  £o  are  now  the  (singleton) 
sets  of  cells  fe  isible  at  time  0.  For  f  ^  1  let  St  be  the  set  of  cells  feasible  for  the 
searcher  at  time  t.  Formally,  Sj  *  (p  there  exists  lin  Sj_i  such  that ;  is  in 
S(i,  t-1)}.  Define  £»  similarly.  Also,  for  f  1  and  /  €  S{,  let  S*{j,  t)  be  the  set  of 
cells  from  which;  is  feasible,  formally  S*(j,  t)  =  {/:;  €  S  (i,  f-l)j,  and  define 
£*(•/•)  similarly.  Finally,  let  «(*,*,•)  be  as  defined  as  in  Section  I,  so  that 

f{m.n,t)  ■  im.rn.n  ,t)u(i,/,f);  XU  ST  (5) 

/€S(i,»vl) 

is  the  penalty  at  time  f  to  the  evader  if  he  occupies  cell  m  at  time  1-1  and  cell  n 
at  time  t,  and  £j,j/(yi.i,yf,l)  is  the  total  expected  penalty,  conditioned  on 


the  evader's  track. 


Consider  first  the  evader's  problem  of  minimizing  the  total  penalty  when 
is  known.  A  dynamic  programming  recursion  is  still  feasible.  Let 
h{m  ,t)  be  the  minimum  total  penalty  over  periods  t,  T  if  the  evader 
occupies  cell  m  at  time  t-1.  Thenh{m,t)  satisfies  the  recursion 

-  min  +A(n,f +  1)};  (6) 

nCE(m/  - 1) 

with  h(»,T+l)  ■  0.  The  minimized  total  penalty  over  all  T periods  is  then 
^(yo»l)/  v/hich  quantity  the  searcher  wants  to  maximize.  Since  (6)  can  be 
written  as  linear  constraints,  maximizing  ft(yo,l)  is  a  linear  program.  The 
program,  with  dual  variables  named  in  braces  as  usual,  is  LPl: 

max  imize/i(yo/l) 

subject  to 

-/■(m,n,t)  •  /!(«,/ +  1)  +  h(m,t)  iO  iT,m  €£(m,t  -1)  {v(m,n,t)} 

;eS| 

•  I«(.LL0+  £w(a,l+l)-0  <T,i  €S,  {g(Lt  +  l)} 

/€S‘(i,f)  k€S(iJ) 

u(i^f,l)  iO  ;1  it  £ T,i  €S{i,f  .  1). 

j'  ■’1  f(m,  rt,  1)  has  been  written  for  compactness,  even  though  the  expression 
on  w.  ..^nt-hand  side  of  (5)  is  meant,  and  it  should  be  understood  that  the 
term  h{n,  f+l)  is  missing  when  t  =  T.  The  second  and  third  sets  of  constraints 
are  the  feasibility  constraints  of  Eagle  and  Washburn;  as  long  as  «(•,•,•) 
satisfies  those  constraints,  there  exists  a  feasible  mixed  strategy  for  the  searcher 
with  «(•,♦,•)  as  the  joint  occupancy  probabilities.  Thus  any  feasible  solution 
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to  LPl  corresponds  to  a  lower  bound  h  fyo/1)  on  the  value  of  the.  CSEG,  and 
consequently  the  same  thing  can  be  said  of  the  maxiihized  value. 

LPl  and  its  dual  DLPl  possess  a  pleasing  symmetry  that  was  absent  in 
Section  2.  DLPl  is  (the  gijMl)  term  is  missing  when  t  =  T) 
minimizeg(xo»l) 


subject  to 

m€£,.i 

n€E{m,t- 1) 

eSf.i,;  £S(i,t  -1)  {u{i,j,t)} 

{him,t  +  l)} 

;1  iT,m  £Ff.i,n  £  E{m,t  •  1). 

Any  function  that  meets  the  second  and  third  sets  of  coristraints  of 

DLPl  can  be  interpreted  as  the  joint  occupancy  probabilities  of  a  feasible 
mixed  strategy  for  the  evader.  That  being  the  case,  the  first  set  of  constraints 
assures  that  a  Si  archer  in  cell  i  at  time  f-l  caimot  obtain  a  payoff  larger  than 
g(i,f)  over  periods  t, T.  In  particular,  g(xo,l)  is  an  upper  bound  on  the 
cumulative  payoff  over  all  T  periods.  But  the  optimized  values  of  g{XQ,l)  and 
h(yo,l)  must  be  equal  because  LPl  and  DLPl  are  duals,  so  either  number  is  the 
value  of  the  CSEG.  Furthermore  the  evader's  optimal  occupancy 


j;  t;(yo,n,l) -1 
«€£] 

•  ]^p(n,m,/)+  +  1)  *0 

n€£*(m/)  k€E(mJ) 

v(m,n,l)  2  0 
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probabilities  can  be  obtained  as  the  dual  variables  associated  with  the  first  set 
of  constraints  in  LPl;  it  is  actually  hot  necessary  to  solve  DLPl. 


Example:  The  revised  one-dimensional  CSEG 

In  the  standard  one-dimensional  CSEG  described  earlier,  it  is  possible 
that  the  two  tracks  xj, xt  and  yi, yr  may  cross  each  other  without  ever 
being  exactly  coincident,  in  which  case  the  searcher's  score  will  be  0  because 
the  objective  function  simply  counts  coincidences.  To  guard  against  this 
possibility,  the  searcher's  leading  edge  as  he  moves  from  1  toN  is  spread  into 
two  approximately  equal  parts,  thus  making  a  barrier  so  wide  that  the  evader 
cannot  "jump  over  it"  (see  Figure  2).  This  annoying  artifact  can  be  eliminated 
by  redefining  the  payoff  so  that  the  searcher  scores  a  point  whenever  the  two 
tracks  cross,  even  if  they  ate  never  exactly  coincident.  Specifically,  for 
1  £  N  let 


t) 


1  if  /  -  n 

1  ifi“«and;*m 

otherwise  0 

I 


(7) 


Figure  7  shows  a  GAMS  program  (Brooke,  Kendrick,  and  Meeraus,  1988)  to 
solve  a  10  cell  CSEG  with  payoff  (7)  where  the  initial  moves  of  searcher  and 
evader  are  from  cells  4  to  5  and  7  to  6,  respectively.  Figure  8  shows  the 
associated  output.  The  value  of  the  CSEG  is  1.2269  (scaled  to  122.69  in  Figure 
8),  to  be  compared  with  .8541  in  the  "standard"  game  where  A(i,j,m,n,t) 
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Figure  7.  One>Dimensional  Crossing  Game 
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Figure  8.  One-Dimensional  Crossing  Game 


simply  indicates  whether  j  =  n.  The  prohibition  of  scoreless  crossovers  ia 
evidently  a  significant  change  m  the  rules  of  the  game.  Note  that  the  leading 
edge  of  the  searcher's  marginals  (P)  is  now  only  1  cel),  wide  for  1  ^  f  ^  6. 

The  revised  game  differs  qualitatively  in  an  interesting  way  f»’om  the 
standard  game.  Let  o^f(T)  and  Vj^iT)  be  the  values  of  the  standard  and  revised 
games  (so  oio(lO)  =  .85^1  and  U|g  (10)  =  1.2269).  Pn{T)  is  ultimately  linear  in  T 
with  slope  1/N.  For  example  z;io(T)  =  1.2269  ^  (T-10)/10  for  T  ^  10.  The 
turnpike  theorem  of  Eagle  and  Washburn  makes  this  plausible;  essentially 
either  side  can  guarantee  a  slope  of  1/N  by  remaining  stationary  in  a 
randomly  chosen  cell.  Stationarity  has  the  same  virtues  in  the  revised  game, 
but  there  is  no  evidence  that  Vfj(T)  is  ultimately  linear.  For 
T  =  (12, 14, 16, 18,  20),  v\q(T)  is  (1.4486,  1.7109,  2.000,  2.1396,  2.3540).  The 
differences  fluctuate  about  .2,  but  are  never  exactly  equal  to  .2.  It  is  possible, 
of  course,  that  T  =  20  is  simply  not  large  enough  to  observe  the  onset  of 
linearity. 
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